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(57) ABSTRACT 

A coder for use in encoding and decoding a data set 
representing an image includes a first device which parti- 
tions the data transformation into first and second sets, 
which adds the first set into a list of insignificant sets (LIS), 
and which initializes a list of significant pixels (LSP), a 
second device which tests the first and second sets for 
significance with respect to a threshold value, which parti- 
tions significant members of the first and second sets in 
accordance with first and second partitioning functions, 
respectively, and which adds significant pixels to the LSP, a 
third device which refines the quantization of the pixels in 
the LSP. In operation, the threshold value is decrement as the 
second and third devices are operation in seriatim until the 
coding endpoint has been reached. An optional fourth device 
entropy codes a significance map cooperatively generated by 
the second and third devices. Methods for encoding and 
decoding the data transformation of a data set and a com- 
puter program for converting a general purpose computer 
into a hierarchical image coder are also described. 

14 Claims, 9 Drawing Sheets 
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EMBEDDED AND EFFICIENT LOW- 
COMPLEXITY HIERARCHICAL IMAGE 
CODER AND CORRESPONDING METHODS 
THEREFOR 

5 

BACKGROUND OF THE INVENTION 

The present invention relates generally to image coders 
and/or decoders (hereinafter simply coders) and correspond- 
ing methods. More specifically, the present invention relates 10 
to an embedded and highly efficient low-complexity hierar- 
chical image coder and corresponding methods therefor. 
Advantageously, a corresponding software program for con- 
verting a general-purpose computer into an embedded and 
highly efficient low-complexity hierarchical image coder is 15 
also disclosed. 

Image coding utilizing scalar quantization on hierarchical 
structures of transformed images has proven to be a very 
effective and computationally simple technique. Shapiro 
was the first to describe such a technique with his Embedded 20 
Zerotree Wavelet (EZW) algorithm in his paper entitled 
"Embedded Image Coding Using Zerotrees of Wavelet 
Coefficients (IEEE Trans, Signal Processing 41, pp. 
3445-3462 (December 1993)). Different variants of this 
technique have appeared in the literature which provide an 25 
improvement over the initial work. Said & Pearlman, in two 
papers entitled "An Improved Zero-tree Method for Image 
Compression (IPL TR-122, ECSE Dept., Rensselaer Poly- 
technic. Inst., Troy, N.Y. (November 1992))" and "Image 
Compression Using the Spatial -orientation Tree (IEEE Int. 30 
Symposium on Circuits and Systems, Chicago, 111., pp. 
279-282 (May 1993))," successively improved the EZW 
algorithm by extending this coding scheme. Moreover, Said 
& Pearlman succeeded in presenting a different implemen- 
tation based on a set -partitioning sorting algorithm, as 35 
described in "A New, Fast and Efficient Image Codec Based 
on Set Partitioning in Hierarchical Trees (IEEE Trans. 
Circuits and Systems for Video Technology, vol. 6(3), pp. 
243-250 (June 1996))." This new coding scheme, called the 
Set-Partitioning in Hierarchical Trees (SPIHT), provided an 40 
even better performance than the improved version of EZW. 

It will be noted that all of these scalar quantized schemes 
employ some kind of significance testing of sets or groups 
of pixels, i.e., the set is tested to determine whether the 45 
maximum magnitude in the set is above a predetermined 
threshold. The results of these significance tests determine 
the path taken by the coder in encoding the source samples. 
These significance testing schemes are all based on some 
very simple principles which allow them to exhibit excellent 5Q 
performance which include, among others: 

(1) the significance testing scheme provides for the partial 
ordering of magnitude coefficients with a set- 
partitioning sorting algorithm; 

(2) the significance testing scheme provides for bit plane 55 
transmission in decreasing bit plane order; and 

(3) the significance testing scheme permits exploitation of 
self-similarity across different scales of an image wave- 
let transform. 

It will also be noted that these significance testing so 
schemes all possess relatively low computational 
complexity, considering the fact that their performance is 
comparable to the best-known image coding algorithms. 
This feature seems in conflict with the well-known tenets of 
information theory that the computational complexity of a 65 
stationary source, i.e., source sample aggregates) increases 
as the coding efficiency of the source increases, as discussed 
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in the book by T Cover and J. Thomas entitled "Elements of 
Information Theory (John Wiley & Sons, Inc., New York 
(1991))." Stated another way, these coding schemes seem to 
have provided a breathing space in the world of simulta- 
neously increasing efficiency and computational complexity. 

It should be mentioned that an important characteristic 
possessed by the class of coders discussed immediately 
above is that of progressive transmission and embeddedness. 
Progressive transmission refers to the transmission of infor- 
mation in decreasing order with respect to its information 
content. In other words, the coefficients with the highest 
magnitudes are transmitted first Since all of these coding 
schemes transmit bits in decreasing bit plane order, this 
ensures that the transmission is progressive. Such a trans- 
mission scheme makes it possible for the bitstream to be 
embedded, i.e., a single coded file can used to decode the 
image at various rates less than or equal to the coded rate, 
to give the best reconstruction possible with the particular 
coding scheme. 

With these desirable features of excellent performance 
and low complexity, along with other characteristics such as 
embeddedness and progressive transmission, it will be 
appreciated that these scalar quantized significance testing 
schemes have recently become very popular in the search for 
practical, fast and efficient image coders and, in fact, have 
become the basis for serious consideration for future image 
compression standards. 

It will be appreciated that while the significance testing 
schemes employing trees permits the exploitation of the 
similarity across different subbands and, thus, the clustering 
of energy in frequency and space in hierarchical structures 
of transformed images, such schemes do not make use of 
sets in the form of blocks. It will be appreciated that 
block-based coding is an efficient technique for exploiting 
the clustering of energy found in image transforms. It is a 
known fact that the statistics of an image transform vary 
remarkably as one moves from one spatial region to another. 
By grouping transform source samples in the form of blocks 
and coding those blocks independently, one advantageously 
can exploit the statistics of each block in an appropriate 
manner. It will be noted that this is one of the reasons that 
block-based coders work quite well. However, there is an 
increasing demand for some desirable properties for image 
coders, such as embeddedness and progressive transmission, 
which are very useful and much needed in the fast-growing 
MultiMedia and networking environment markets. Both the 
SWEET and the AGP block-processing algorithms, which 
are discussed below, are very efficient but do not possess 
these desirable properties. 

What is needed is a low-complexity hierarchical image 
coder and corresponding methods for coding/decoding 
blocks wherein the signal is completely embedded. 
Moreover, what is needed is a low-complexity hierarchical 
image coder and corresponding methods which employ 
progressive transmission. Furthermore, a low-complexity 
hierarchical image coder and corresponding method which 
have low dynamic memory requirements would be 
extremely desirable, particularly when the low-complexity 
hierarchical image coder is to be employed as a discrete 
module in a device having limited computing power. Lastly, 
a low-complexity hierarchical image coder and correspond- 
ing method which can be employed for both lossy and 
lossless compression would be particularly advantageous. 

SUMMARY OF THE INVENTION 

Based on the above and foregoing, it can be appreciated 
that there presently exists a need in the art for an embedded 
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and highly efficient low-complexity hierarchical image repeatedly store and release data representing blocks of an 

coder and corresponding method which mitigates the above- image being decoded to thereby permit the hierarchical 

described deficiencies. The present invention was motivated image coder to reconstruct an image larger in size than 

by a desire to overcome the drawbacks and shortcomings of available decoder computer memory, 

the presently available technology, and thereby fulfill this 5 

need in the art. BRIEF DESCRIPTION OF THE DRAWINGS 


One aspect according to mejpresent invention is provided ^ vafious olher feamres afld ^ of (he 

by a method for use in encoding an decoding a data set . .... ,., , . \ , , 

J n c li .v .uj-ij present invention will be readily understood with reference 

corresponding to an image. Preferably, the method includes . „ . 1 , . . 

- * . f ~.. . « . j to the following detailed description taken in conjunction 

a first subroutine for partitioning the data set into first and in . . . & . . . r . - 1 

~ j * e jj- .u a . . • . i- . p- • a . with the accompanying drawings, in which like or similar 

second sets, for adding the first set into a list of insignificant ( . r . , • . -. 

> /i ip\ 1 c • 1 ■ i-.c--n.-i numbers are used throughout, and in which: 

sets (LIS), and for initializing a list of significant pixels & ' 

(LSP), a second subroutine for testing the first and second FIG - 1 illustrates transforming f an image X into sets S 

sets for significance with respect to a threshold value, anc * ^ 

partitioning significant members of the first and second sets 15 FIGS. 2A and 2B collectively forma high-level block 

in accordance with first and second partitioning functions, diagram of the four subroutines executed in the SPECK 

respectively, and adding significant pixels to the LSP, a third algorithm according to the present invention; 

subroutine for refining the quantization of the pixels in the FIGS. 3A-3D are exemplary pseudocode listings of the 

LSP, and a fourth subroutine for decrementing the threshold functions called during execution of the subroutines illus- 

value. It will be appreciated that the term image encom- 20 trated in FIGS. 2A and 2B; 

passes a sequence of images which vary over time. FIG. 4 is useful in understanding the concept of quadtree 

Preferably, the second, third and fourth subroutines are partitioning a xl s int0 four subsels; 

repeated until encoding/decoding of the data set has been , . , . ,. , 

completed. Most preferably, the second, third and fourth . 5 understanding the concept of parti- 

subroutines are repeated until one of the encoding/decoding 25 tl0nm S a ^ 1 mt0 four new ^ 

of the data indicative of the lowest bit plane has been FIG - 6A is a graph comparing the results achieved using 

completed or the bit budget has been spent. Additionally, the the SPECK algorithm with those of other algorithms, while 

fourth subroutine advantageously can include a routine for ^OS. SB-6D are tables mapping performance to respective 

entropy coding a significance map generated in accordance standard test images; and 

with the second and third subroutines. 30 FIG. 7 is a high-level block diagram of a computer 

Another aspect according to the present invention is network capable of performing the SPECK algorithm illus- 

provide by a coder for use in encoding and decoding a data trated in FIGS. 2 A and 2B. 

set corresponds to an image, which includes a first device for nPTATl Fn nPSPRTPTinN OP THF 

phoning fte data ^formation into first and second ° PURRED eKdImS 

sets, for adding the first set into a list of insignificant sets 35 

(LIS), and for initializing a list of significant pixels (LSP), Illustrative embodiments and exemplary applications will 

a second device for testing the first and second sets for now be described with reference to the accompanying 

significance with respect to a threshold value, partitioning drawings to disclose the advantageous teachings of the 

significant members of the first and second sets in accor- present invention. 

dance with first and second partitioning functions, 40 While the present invention is described herein with 

respectively, and adding significant pixels to the LSP, and a reference to illustrative embodiments for particular 

third means for refining the quantization of the pixels in the applications, it should be understood that the invention is not 

LSP. Preferably, the second and third devices are employed limited thereto. Those having ordinary skill in the art and 

seriatim as the threshold value is decremented until access to the teachings provided herein will recognize 

encoding/decoding of the data set has been completed. In 45 additional modifications, applications, and embodiments 

addition, the coder can include a fourth device for entropy within the scope thereof and additional fields in which the 

coding a significance map cooperatively generated by the present invention would be of significant utility, 

second and third devices. The image coding algorithm according to the present 

A still further aspect according to the present invention is invention, which hereinafter will be referred to as the Set 

provided by a recording medium storing computer readable 50 Partitioned Embedded bloCK coder (SPECK) algorithm, 

instructions for converting a general purpose computer into can be said to belong to this class of scalar quantized 

an hierarchical image coder, wherein the image coder is a significance testing schemes. Thus, the SPECK algorithm 

low-complexity image coder which generates a selectively has its roots primarily in the ideas developed in the Set 

embedded bit stream suitable for progressive transmission. Partitioning in Hierarchical Trees (SPIHT), Alphabet & 

Preferably, the computer readable instructions permit the 55 Group Partitioning (AGP) and SWEET image coding algo- 

general purpose computer to repeatedly store and release rithms. The AGP algorithm was first disclosed in the paper 

data representing blocks of a image being encoded to by A. Said and W. Pearlman entitled "Low-complexity 

thereby permit the hierarchical image coder to process an Waveform Coding via Alphabet and Sample-set Partitioning 

image larger in size than available coder computer memory. (Visual Communications and Image Processing '97, Proc. 

Another aspect according to the present invention pro- 60 SPIE 3024, pp. 25-37 (February 1997))/' The SWEET 

vides a recording medium storing computer readable algorithm was described in the article by J. Andrew entitled 

instructions for converting a general purpose computer into "A Simple and Efficient Hierarchical Image Coder (1CIP 97, 

an hierarchical image decoder, wherein the image decoder is IEEE Int'l Conf. on Image Proc. 3, pp. 658, Paper No. 573 

a low<:omplexity image decoder which reconstructs an (1997))." It will be appreciated that the SPECK algorithm is 

image from a selectively embedded bit stream transmitted 65 different from SPIHT and SWEET in that it partitions 

by progressive transmission. Preferably, the computer read- blocks, not trees, and is different from AGP in that it is 

able instructions permit the general purpose computer to embedded. The main idea is to exploit the clustering of 
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energy in frequency and space in hierarchical structures of 
transformed images. 

A more detailed discussion of the SPECK algorithm with 
respect to these other algorithms, and coders employing the 
same, is given immediately below. However, it should first 
be mentioned at this point that the SP1HT and AGP algo- 
rithms are fully described, respectively, in U.S. Pat. No. 
5,764,807, which is entitled Data Compression Using Set 
Partitioning in Hierarchical Trees, and which issued to W. A. 
Pearlman and A. Said on Jun. 9, 1998, and U.S. Pat. No. 
5,959,560, which is entitled Data Compression via Alphabet 
Partitioning and Group Partitioning, and which issued to A. 
Said and W A. Pearlman on Sep. 28, 1999. It should also be 
mentioned that both of the above-mentioned patents are 
incorporated in their entirety by reference. 

The SPECK algorithm was motivated by the desire to 
provide a single algorithm having all of the features dis- 
cretely apparent from analysis of the SPIHT, SWEET and 
AGP algorithms. However, although the SPECK algorithm 
employs ideas from all of these coding schemes, the SPECK 
algorithm is different from these coders in various respects. 
In particular, the SPECK algorithm according to the present 
invention is a new image compression algorithm that is a 
novel synthesis of the AGP and SPIHT algorithms, particu- 
larly with respect to embedded coding, i.e., a single bit 
stream that allows progressive decoding of lower rate 
images by repeated truncation. It will be appreciated that this 
capability is important for delivery of different quality 
services to different users from a single encoding operation. 

In contrast, AGP works on the principle of recursive 
quadrature splitting of significant sub-blocks, where a sub- 
block is significant if its maximum value equals that of the 
four sub-blocks; otherwise, it is insignificant. In other 
words, AGP partitions blocks and encodes subblock 
maxima. A redefinition of maximum value as the deepest 
significant bit (floor (log 2 (maximum value)), and progres- 
sively decrementing the global deepest significant bit by one 
enables bit-plane embedded coding. In contrast, the SPECK 
algorithm employs the same lists, i.e., a list (LIS) consisting 
of insignificant sets of points, a list (LSP) consisting of 
significant points and a list (LIP) consisting of insignificant 
points to store pointers to insignificant pixels, insignificant 
sets (blocks), and significant pixels, respectively. However, 
as discussed below, LIP and LIS are combined in the SPECK 
algorithm in a way that retains the functionality of separate 
lists. The order in which pixels are tested, distributed to 
these lists, and found significant is duplicated at the decoder, 
so that bit planes of significant pixels are decoded in a 
progression from highest to lowest bit planes until the bit 
budget is spent. The creation of the above-mentioned lists in 
the encoder is enabled by the binary outcomes of the 
significance tests, which are sent to the decoder where they 
are duplicated. 

Before discussing the SPECK algorithm in detail, it 
should be mentioned that U.S. Pat. No. 5,748,116, which is 
entitled System and Method for Nested Split Coding of 
Sparse Data Sets, and which issued to C. IC Chui and R. Yi 
on May 5, 1998, describes a coding algorithm called NQS, 
which also employs a quadrature splitting method. NQS 
uses the so-called deepest bit paradigm, but without subse- 
quent coding, and does not produce an embedded bit stream. 
Moreover, the NQS algorithm does not employ lists by 
which the sets are sorted and ordered. It should be men- 
tioned that AGP is much more general, as it includes 
methods to encode the significance bits. i 

As discussed above, the SPECK algorithm combines 
aspects of AGP, which is non-embedded, and which works 
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on a quadtree with intermediate nodes associated with sets 
and only leaf nodes with pixels, with SPIHT, which is 
embedded, and which works on spatial-orientation trees, 
which are quadtrees having a different structure, i.e., having 
intermediate nodes associated with pixels or sets. Unlike any 
of the algorithms discussed above, the SPECK algorithm 
partitions the sets in increasing size to obtain a finer scale of 
embeddedness and, thus, more efficient compression. 
Whereas SPIHT is a tree -based fully embedded coder which 
employs progressive transmission by coding bit planes in 
decreasing order, the AGP and SWEET coding algorithms 
are block-based coders which are not embedded and do not 
employ progressive transmission. 

With respect to the SPIHT algorithm, SPIHT works by 
grouping pixels together in the form of spatial orientation 
trees. It is well known that a subband pyramid exhibits 
similarities across its subbands at the same spatial orienta- 
tion. The SPIHT algorithm exploits this characteristic of the 
image transform by grouping pixels together into such 
structures. 

The AGP algorithm, on the other hand, is a block-based 
coding algorithm which partitions the image transform in the 
form of blocks. The blocks are recursively and adaptively 
partitioned such that high energy areas are grouped together 
into small sets whereas low energy areas are grouped 
together in large sets. Such a type of adaptive quadtree 
partitioning results in efficient coding of the source samples. 
It should be mentioned that the AGP algorithm is a block 
entropy coder which first reduces the alphabet of the image 
transform to a tractable level. The AGP algorithm then 
partitions the image transform in the form of blocks and 
codes those blocks using a powerful entropy coding method. 

The SWEET coding algorithm is also block based and 
uses octave -band partitioning to exploit the pyramidal struc- 
ture of image transforms. SWEET codes a block up to a 
certain bit -depth, n w>1 , before moving onto the next one. 
Different rates of compressed images are obtained by appro- 
priately choosing the minimum bit-plane, n^^, to which to 
encode. Finer sets of compression ratios are obtained by 
scaling the image transform by some factor prior to the 
coefficient coding. 

As mentioned above, block-based coding is an efficient 
technique for exploiting the clustering of energy found in 
image transforms because the statistics of an image trans- 
form vary remarkably as one moves from one spatial region 
to another. Thus, by grouping transform source samples in 
the form of blocks and then coding those blocks 
independently, one advantageously can exploit the statistics 
of each block in an appropriate and advantageously manner. 
Since there is an increasing demand for some desirable 
properties for image coders, such as embeddedness and 
progressive transmission, and since both SWEET and AGP, 
although very efficient, do not possess these desirable 
properties, a different block-based coding algorithm would 
be particularly advantageous. 

The SPECK coding algorithm solves this problem by 
exhibiting the above-mentioned important properties, which 
are lacking in most, if not all, block-based coding algo- 
rithms. First, the SPECK algorithm provides a fully embed- 
ded block-based coder which employs progressive transmis- 
sion by coding bit planes in decreasing order. Second, the 
SPECK algorithm employs the octave-band partitioning of 
SWEET to exploit the hierarchical structure of the subband 
pyramid and concentrate more on potentially high energy 
subbands. Thirdly, the SPECK algorithm employs the adap- 
tive quadtree splitting methodology associated with the AGP 
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algorithm to zoom into high energy areas within a region and become obvious later on, an array of smaller lists of type 

to subsequently code these areas with minimum significance LIS, each containing sets of type S of a fixed size, advan- 

maps. Moreover, the SPECK algorithm employs the signifi- tageously can be employed instead of using a single large list 

cance map schemes of EZW and SPIHT to code the image having sets S of varying sizes. It will be appreciated that the 

transform progressively in decreasing bit-plane order. All of 5 use of multiple lists will speed up the encoding/decoding 

these features make SPECK a very efficient block-based process. 

embedded image coding algorithm. As mentioned immediately above, the SPECK algorithm 

In the discussion that follows, a brief outline of the employs use of only two lists, LIS and LSP, unlike SPIHT 

SPECK coding scheme and an explanation of the terminol- which makes use of an additional list, LIP, that stores 

ogy used in connection with the SPECK algorithm will first i° insignificant coefficients. It will be appreciated that the 

be presented. Thereafter, a detailed explanation of the contents of both US and UP are integrated into a single list 

SPECK algorithm will be provided. LIS in the SPECK algorithm. Hence, when reference is 

Referring to FIG. 1, consider an image X which has been made t0 ^ LIS "» this algorithm, it will be recognized 

adequately transformed using an appropriate data lhat both the ^^ificant sets and msigmficant coefficients 

transformation, e.g., by employing the discrete wavelet 15 arc bcm S rcferrcd t0 - ^ motivation behind the use of a 

transform. The transformed image can be said to exhibit a ™& Ust LIS ™ tDC SPECK algorithm is functionally 

hierarchical pyramidal structure defined by the levels of similar 10 maintaining separate lists, LIS and UP, as id 

decomposition, with the topmost level being the root. The SPIHT ^ ldca ^hind the strategy of processing sets in 

finest pixels lie at the bottom level of the pyramid while the increasing order of their size is that during the course of its 

coarsest pixels are found at the top (root) level. The image 20 execution, the algorithm sends those coefficients to LIS 

X advantageously can be represented by an indexed set of whose immediate neighbors have tested significant against 

transformed coefficients { C// }, located at pixel position (ij) ™™ threshold n but they themselves have not tested 

in the transformed image. significant against that particular threshold. Chances are, 

„ c , ! , j* .u - * u*u because of energy clustering in the transform domain, that 

Preferably, these pixels are grouped together in sets which , ■ j i 

. ■ *u . t j r? ii *u 25 these insignificant coefficients would have magnitudes close 

comprise regions in the transformed image. Following the , ^ . , c , . .... ™- . , , 

,, r , , ° - t j oniir-T- f *i_ ** i_ to the magnitudes of their neighboring coefficients already 

methodology associated with the SPIHT algorithm, it can be , . B . e . , T . . . ,., , . 

■ • .1 . . rr. r ' i * ■ 'c » *u * . tested significant, although lesser. In short, it is likely lhat 

said that a set T of pixels is significant with respect to n if . • • . , 

these coefficients will test positive to some nearby lower 

threshold and add to the reduction in the overall distortion of 

( gj?r ,k * ,) * T 30 the coded image. 

Having established and defined the terminology will be 

employed in discussing the SPECK coding method, the 

otherwise it is insignificant. Moreover, the significance of a SPECK algorithm will now be described in detail. 

set T can be defined as a function of n and the set T, i.e., SPECK coding algorithm is illustrated in FIGS. 2A 

35 and 2B. As will be appreciated from these figures, the actual 

!i' it T z^WiJ<2 n + l SPECK algorithm consists of an initialization subroutine 

Q ' €r 100, a partitioning subroutine or pass 200, a refinement 

e * e subroutine or pass 300, and the quantization subroutine 400. 

FIGS. 3A-3D illustrate, in pseudocode listing, the specific 

It should be mentioned that the SPECK algorithm makes ™ functions called during performance of the subroutines 100, 

use of rectangular regions of image. These regions or sets, 200, 300, and 400. These four functions are ProcessS( ), 

which will henceforth be referred to as sets of type S, CodeS( ), ProcessI( ), and CodeI( ) in FIGS. 3A-3D. 

advantageously can vary dimensionally. In other words, the The detailed working of the SPECK algorithm presented 

dimension of a set S depends on the dimension of the in FIGS. 2A and 2B will now be described. It will be noted 

original image and the subband level of the pyramidal 4 * that the starting source for the SPECK algorithm is a 

structure at which the set lies. rectangular image X that has undergone an appropriate data 

Moreover, the size of a set can be defined to be the transformation. As illustrated in FIG. 1, the image X consists 

cardinality C of the set, i.e., the number of elements (pixels) of transformed coefficients {c iy }, located at pixel position 

in the set, as: I* will be appreciated that such an image exhibits a 

50 hierarchical pyramidal structure having subbands at differ- 

«ize(S) ( ^Pl cnt j cvc i s 0 f ; ts decomposition. The topmost band is referred 

During the execution of the SPECK algorithm, sets of to as the root of the pyramid, 

various sizes will be formed, depending on the characteris- The SPECK algorithm subroutine 100 starts by calculat- 

tics of pixels in the original set. It will be noted that a set of ing the value n,^ during step S101, which value is then 

size 1 consists of just one pixel. The other type of sets used 55 output to a memory device or a communications channel, as 

in the SPECK algorithm are referred to as sets of type I. discussed below, during step S102. It will be appreciated that 

These sets are obtained by chopping off a small square the maximum threshold n,^ is the value which satisfies the 

region from the top left portion of a larger square region, as criteria c,y is insignificant with respect to n mox +l, Vc,y eX, 

illustrated in FIG. 1. but is significant against the threshold n max , for some c^-eX. 

It should also be mentioned that during execution of the 60 Then, the data representing the image X is formed into sets 

SPECK algorithm two linked lists, i.e., LIS — List of Insig- by partitioning the transform corresponding to the image 

nificant Sets, and LSP — List of Significant Pixels, are main- into two sets, i.e., a set S, which is the root of the pyramid, 

tained. The former list contains the sets of type S of varying and a set I, which is everything that is left of the image after 

sizes which have not yet been found significant against a taking out the root (see FIG. 1), during step 103. During step 

predetermined threshold n. In contrast, the latter list advan- 65 S104, set S is initialized LIS and LSP is set equal to 0. 

tageously contains those pixels which have tested significant During the partitioning subroutine 200, the SPECK algo- 

against the predetermined threshold n. Alternatively, as will rithm performs quadtree partitioning of the image. More 


02/05/2004, EAST Version: 1.4.1 


US 6,671,413 Bl 

9 10 

specifically, during step S2 10, set S in US is processed by the size of each of these three sets S is the same as that of 

testing it for significance against the threshold n in accor- the chopped portion of X. The new set I that is formed by 

dance with the function labeled ProcessS(S), which is illus- this partitioning process is thus reduced in size, 

trated in FIG. 3 A- Assuming that set S is significant, it is More specifically, for each 1, a test is performed to 

partitioned, by a quadtree partitioning process, into four 5 determine if I is significant during step S221. The results of 

subsets 0(S), each having a size approximately one-fourth this determination are output to the communications channel 

the size of the parent set S in accordance with the function during step S222. If I is not significant, no further action is 

labeled CodeS(S), as illustrated in FIG. 3B. FIG. 4 illustrates taken, and the next I is checked for significance during step 

this particular partitioning process. S223. If I is significant, I is partitioned into three S-type sets 

More specifically, during step S211, the set S is tested to 10 and a new set I during step 224. For each S, the subroutine 

see if it is significant or not. If S is not significant, indicated jumps back to S211, the start of subroutine 210. For each I, 

by a value of 0, S is then tested to determine whether S is the subroutine 220 is repeated, i.e., the subroutine jumps 

not an element of LIS during step S213. If the answer is true, back to the start of step S231. 

S is added to US. If the determination in step S 211 is that It should be mentioned at this point that the principle 
is S is significant, a test is performed to determine if S is 15 behind this partitioning methodology is exploitation of the 
included in LIS. If the answer is negative, the subroutine hierarchical pyramidal structure of the subband 
jumps to the input of step S217. When the answer is YES, decomposition, where it is more likely that the energy is 
S is removed from LIS during step S216 and a further test concentrated at the top most levels of the pyramid and, as 
is performed to determine where the size of S corresponds one goes down the pyramid, the energy content decreases 
to a pixel during step S217. If the answer is YES, S is added 20 gradually. Thus, if a set I is significant against some thresh- 
to LSP during step S218 and the sign is output to, for old n, it is more likely that the pixels that cause I to be 
example, the communications channel during step S219. If significant lie in the top left regions of the set I. These 
the answer is NO, the subroutine 230 is executed, i.e., the regions are decomposed into sets of type S, and are put next 
routine CodeS(S) illustrated in FIG. 3B is executed. In either in line for processing. 

case, the determination made during step S211 is output 25 In this way, regions that are likely to contain significant 

during step S212. pixels are grouped into relatively smaller sets and processed 

During subroutine 230, the set S is partitioned into four first, while regions that are likely to contain insignificant 

equal sets O(S) during step S231. Each set O(S) is ten tested pixels are grouped into a large set. Advantageously, a single 

to determine whether or not it is significant during step bit may be enough to code this large region against the 

5233. In either case, the determination result is output to the 30 particular threshold. Hence, once the set I is partitioned by 
communications channel during step 232. If the 0(S) is not the octave band partitioning method, the three sets S are 
significant, 0(S) is added to LIS during step S235. If O(S) processed in the regular image -scanning order, after which 
is significant, the size of O(S) is determined during step the newly formed reduced set I is processed. 

5234. When O(S) is equal in size to a pixel, O(S) is added It should be noted that processing the set I is a recursive 
to LSP during step S236 and the sign is output to the 35 process, and depending on the characteristics of the image, 
communications channel during step S237. When 0(S) is at some point in the algorithm, the set I will cover only the 
larger than one pixel, the subroutine 230 is repeated. lowermost (bottom) level of the pyramidal structure, as 

Thus, each of these four subsets is treated, in turn, as a set indicated with respect to step S230. When, at this point, the 

of type S and processed recursively until the pixel-level is set I tests significant against some threshold, it will be 

reached where all of the pixels that are significant in the 40 broken down into three sets S but there will be no new 

original set S are located and thereby coded. It should be reduced set I. To be precise, the new set 1 will be an empty 

noted that the pixels/sets that are found to be insignificant set. Hence, the functions ProcessI(S) and Codel(S) will have 

during this 'hunting' process are added to LIS to be tested no meaning in the SPECK algorithm after this event, 

against the next lower threshold during a subsequent stop in It should also be mentioned that an important step in the 

the SPECK algorithm. The motivation for quadtree parti- 45 execution of the partitioning pass comes after the first run of 

tioning of such sets is to rapidly focus on the areas of high the algorithm. Once one pass has occurred, sets of type S of 

energy in the set S and code these areas first. An equally varying sizes are added to US. During the next lower 

important motivation for quadtree partitioning of such sets is threshold, these sets are processed in a particular order. The 

identify large insignificant regions, which are relatively list LIS is not traversed sequentially for processing sets S; 

common in image transforms, since insignificant sets at a 50 rather, the sets are processed in increasing order of their size, 

threshold n corresponding to such regions advantageously In other words, say for a square image, sets of size 1 (i.e., 

can be specified by a single bit. pixels) are processed first, sets of size 4 (blocks of 2x2 

At this stage in the execution of the SPECK algorithm, pixels) are processed next, and so on. 

there are no more sets of type S that need to be tested against The idea behind this strategy is that during the course of 

the threshold n. It will be appreciated from the discussion 55 its execution, the SPECK algorithm sends those pixels to 

above that if there were untested sets of the type S, these sets LIS whose immediate neighbors have tested significant 

would be processed before going on to the next part of the against some threshold n but they themselves have not tested 

SPECK algorithm. Thus, once all sets of type S are significant against that particular threshold. Chances are, 

processed, the set I is processed by testing it against the same because of energy clustering in the transform domain, that 

threshold n in accordance with a function labeled ProcessI 60 these insignificant pixels would have magnitudes close to 

(S), which is illustrated in FIG. 3C, during subroutine 220. but less that the magnitudes of their neighboring pixels 

If the set I is found to be significant, the set I is partitioned already tested significant. Thus, it is likely that these pixels 

by yet another partitioning scheme, i.e., octave band parti- will test positive to some nearby lower threshold and add to 

tioning. FIG. 5 illustrates this partitioning scheme. Set I is the reduction in the overall distortion of the coded image, 

partitioned into four sets, i.e., three sets of type S and one set 65 Moreover, the overhead involved in testing a single pixel 

of type I in accordance with the function labeled Codel(S), and moving it to LSP is much lower than that involved in 

which is illustrated in FIG. 3D. It should be mentioned that testing a group of pixels and moving the significant ones in 
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it to LSR Of course, if a whole partitioning pass is 
completed, this scheme offers no advantage since all the sets 
of type S in LIS would be tested in either case. However, if 
the coding algorithm were to stop in the middle of a 
partitioning pass, as it might if the desired rate is achieved, 5 
and the sets in LIS are processed in increasing order of their 
size, then the SPECK algorithm will provide a significant 
performance improvement. 

It may seem that processing sets of type S in increasing 
order of their size involves a sorting mechanism, something 
which is not desirable in fast implementation of coders. 
However, there is a simple way of completely avoiding this 
sorting procedure. 

Note that the way sets S are constructed, they lie com- 
pletely within a subband. Thus, every set S is located at a 
particular level of the pyramidal structure. Partitioning a set 15 
S into four offsprings O(S), i.e., forming sets S of a new 
reduced size, is equivalent to going down the pyramid one 
level to a correspondingly finer resolution. Hence, the size of 
a set S for an arbitrary image corresponds to a particular 
level of the pyramid. Advantageously, the SPECK algorithm 20 
employs an array of fists, each corresponding to a level of 
the pyramid, i.e., each list stores sets of a fixed size. 
Processing the lists in an order which corresponds to 
increasing size of sets completely eliminates the need for 
any sorting mechanism for processing the sets S. Thus, the 25 
SPECK algorithm does not need to compromise the speed of 
the algorithm by employing some kind of sorting mecha- 
nism. 

It should be noted that using an array of lists does not 
increase the memory requirements for the coder, as opposed 30 
to using a single list. This is because the total number of sets 
S that are formed during the coding process remain the 
same. Instead of storing these sets in one large list, we are 
storing them in several smaller lists with the aim of speeding 
up the coding process. 35 

Once all the sets S and I have been processed, the 
partitioning subroutine 200 for that particular threshold n is 
completed, after which the refinement subroutine 300 is 
initiated. The subroutine 300 refines the quantization of the 
pixels in LSP, i.e., those pixels which were tested significant 40 
during the previously executed partitioning subroutine 200. 
It should be mentioned that once the subroutine 300 has been 
performed, the threshold n is lowered and the sequence of 
partitioning and refinement subroutines 200 and 300 is 
repeated against this lower threshold n. This process is 45 
repeated until the desired rate is achieved or, in case of 
lossless or nearly lossless compression, all the thresholds up 
to the last, corresponding to threshold n«0, are tested. 

Further quantization, and possibly entropy coding of 
significance map using arithmetic coding with simple 50 
context-based models, is then performed during subroutine 
400. Moreover, during step S420, the threshold □ is decre- 
mented by one and this threshold is examined during step 
S430 to determine whether or not the n has fallen below a m£n 
or the bit budget has been exhausted. If n is not below n TO -„, 55 
subroutines 200 and 300 are repeated. If the threshold is 
below n min , the SPECK algorithm ends. 

The performance of the SPECK algorithm according to 
the present invention was determined with respect to several 
monochrome, 8 bpp, 512x512 images. In each case, 5-level 60 
pyramids constructed with the 9=7 tap biorthogonal filters 
were employed. Moreover, the images were pre-processed 
using a reflection extension at the image edges. It will be 
noted that the bit rales are calculated from the actual size of 
the compressed files. Since the codec is embedded, the 65 
results for various bit rates are obtained from a single 
encoded file. 
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FIGS. 6A-6D show the PSNR obtained by this coding 
method at the rates 0.25, 0.5 and 1.0 bpp for the three images 
'Lena*, * Barbara* and 'GoldhilT. These results are obtained 
by entropy-coding the bits put out by the coding scheme. 
Similar results are also included for the EZW, SPIHT and 
AGP coding schemes, which also employ entropy coding of 
the significance map. In particular, FIG. 6A presents the 
rate -distortion curves for the three images at rates up to 1 
bpp while FIGS. 6B-6D present these comparisons in tabu- 
lar form. The results are of the same nature as mentioned 
above for the three coding schemes. It will be appreciated 
from the numerical results, the SPECK coding scheme 
provides excellent results, comparable to the popular image 
coding schemes, such as SPIHT. For some images, such as 
Barbara, the SPECK algorithm actually gives slightly better 
performance than SPIHT. 

Referring back to the discussion of the SPECK algorithm, 
it will be appreciated that in the function CodeS(S), the 
significance test results of the four subsets 0(S) of set S are 
not coded separately. Rather, they are all first coded together 
before further processing of the subsets. The SPECK algo- 
rithm employs conditional coding for coding the signifi- 
cance test result of this four-subset group. In other words, 
the significance test result of the first subset is coded without 
any context, while the significance test result of the second 
subset is coded using the context of the first coded subset, 
and so on. In this way, previously coded subsets form the 
context for the subset being currently coded. 

Also, it should be mentioned that the SPECK algorithm 
makes use of the fact that if a set S is significant and its first 
three subsets are insignificant, then this ensures that the 
fourth subset is significant and, thus, the SPECK algorithm 
does not have to send the significance test result of the last 
subset. This fact advantageously can be utilized in reducing 
the bit budget. Results have shown that because of the nature 
of energy clustering in the pyramidal structure, the number 
of scenarios of the above-mentioned type occur slightly 
more if the four-group subsets are coded in reverse-scanning 
order than in the usual forward-scanning order. This saves 
some overhead in bit budget and provides corresponding 
gains. 

It should be mentioned that the decoder employs the same 
methodology as the encoder. In other words, the decoder 
receives significance test results, e.g., output during steps 
S212, S219, S232, S237, and S222, as a coded bitstream and 
reconstructs the same list structure produced by the encoder 
during the execution of the SPECK algorithm. It will be 
appreciated that the value r^^ is received by the decoder 
rather than calculated. Hence, the decoder advantageously 
can follow the same execution paths for the significance 
tests of the different sets and, thus, reconstructs the image 
progressively as the algorithm proceeds. 

In summary, the SPECK image coding method has all the 
properties characteristic of scalar quantized significance 
testing schemes. In particular, the SPECK method exhibits 
the following properties: 

It is completely embedded — a single coded bitstream can 
be used to decode the image at any rate less than or 
equal to the coded rate, to give the best reconstruction 
of the image possible with the particular coding 
scheme. 

It employs progressive transmission — source samples are 
coded in decreasing order of their information content. 

It has low computational complexity — the algorithm is 
very simple, consisting mainly of comparisons, and 
does not require any complex computation. 

It has low dynamic memory requirements — at any given 
time during the coding process, only one connected 
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region (lyi n g completely within a subband) is pro- an I/O device 918. Although the computers 700 and 710 are 
cessed. Once this region is processed, the next region is illustrated in such a way that one could concluded that these 
then considered for processing. computing devices are substantially similar, this need not be 
It has fast encoding/decoding — this is due to the low- the case. For example, in FIG. 7, the computer 700 advan- 
complexity nature of the algorithm. 5 tageously could be a personal computer whereas the com- 
It has efficient performance — its efficiency is comparable puting device 710 advantageously could be a telephone or a 
to the other low-complexity algorithms available today. personal digital assistant (PDA) device. Moreover, it will be 
It can be used for lossy and lossless compression — appreciated that the CPU advantageously can be replaced by 
depending on the choice of the transform, the algorithm a digital signal processor (DSP). Preferably, the communi- 
can be used for lossless or nearly lossless coding, apart 10 cations channel 720 connotes any type of data path which 
from lossy coding. permits computer 700 to transfer data representing an image 
It will be appreciated that image compression is used to computer 710. 
almost everywhere where images are stored and/or trans- It will be appreciated that the individual programming 
mitted. Applications include, but are not limited to Internet steps required to implement the techniques of the present 
image transmission, digital photography, telemedicine, par- 15 invention will be apparent to one of ordinary skill in the art 
ticularly teleradiology, and PACS medical workstations. The in view of the discussion presented herein. 
SPECK algorithm is applicable to all of the foregoing One anticipated requirement of the next generation corn- 
applications, as well as to remote sensing, cartography, pression standard is the inclusion of a stringent memory 
printing, and scanning equipment. constrained property. What this effectively means is that at 
Although the techniques of the present invention have 20 any given time during the coding process, the coder should 
been described herein with respect to image processing, use only a limited amount of memory, which should be 
other forms of data may be processed. Any data set that may independent of the size of the image. If a coding scheme 
be transformed through subband decomposition may subse- does not conform to this requirement, applications that 
quently have the transform coefficients coded for transmis- require very large images on machines that have limited 
sion and/or storage using the disclosed techniques. For 25 amount of workspace memory will find it impractical to use 
example, a digitized audio segment may be decomposed into such a coding scheme. Hence, it is vital to have the coder use 
frequency subbands and encoded as described herein. It will limited workspace memory, independent of the size of the 
be appreciated that the audio subband blocks (sequences) are image. 

recursively partitioned into two subblocks to form a binary It should be mentioned that the wavelet transformation 
tree (bintree) structure instead of the quadtree structure 30 property that the SPECK algorithm utilizes transforms the 
employed with two-dimensional data sources. Furthermore, entire image at once. Hence, the memory needed is of the 
the coding techniques of the present invention may be order of the image size. In order to implement low-memory 
applied to various types of subband decompositions with coders, it will be a appreciated that the wavelet transform 
their associated filters), e.g., to coding discrete cosine module must be replaced by either a line-based or a block- 
transform (DCT) blocks used in conjunction with MPEG 35 based wavelet transform. It will also be appreciated that in 
image processing, as discussed in greater detail below. the line-based transform, the entire image is not transformed 
The hardware elements of a computer system used to at once. Rather, the image is read line -by-line until enough 
implement the techniques of the present invention are shown lines have been read in to perform a wavelet transformation 
in FIG. 7. A computer 700 includes a central processing unit with the particular wavelet filter. This generates a few lines 
("CPU") 706 providing main processing functionality. A 40 of the transform coefficients corresponding to the lowest 
memory 704 is coupled to CPU 906 for providing opera- subband levels. These lines of transformed data advanta- 
tional storage of programs and data. Memory 704 may geously can then be processed and then discarded from 
comprise, for example, random access memory ("RAM") or memory. Thus, this process of transforming lines of input 
read-only memory ("ROM"). Nonvolatile storage of, for data to generate lines of transform coefficients conserves 
example, data files and programs is provided by a storage 45 memory. 

702 that may comprise, for example, disk storage. Both It will be a appreciated that the encoder according to one 

memory 704 and storage 702 comprise a computer useable low-memory alternative embodiment according to the 

medium that may store computer program products in the present invention can keep accumulating these lines of 

form of computer readable program code. User input and coefficients in memory until enough data has been gathered 

output is provided by an input/output ("I/O") device 708. I/O 50 to perform the SPECK algorithm. Once that data has been 

facility 708 may include, for example, a graphical display, a processed, the data can be discarded, i.e., the memory space 

mouse and/or a graphics tablet. As an example, the computer can be freed for other purposes. A similar but reverse 

system 700 of FIG. 7 may comprise an International Busi- procedure occurs during the decoding process. This elimi- 

ness Machines RISC System/6000 computer executing an nates the need for an image to be stored in memory all at 

AIX operating system. 55 once. 

Moreover, as illustrated in FIG. 7, a second computer 710 In contrast to the octave-band partitioning process 

advantageously can be connected to the computer 700 via a employed by the original SPECK algorithm discussed in 

communications channel 720 to form a computer or com- detail above, the partitioning process in the low-memory 

municalions network. It will be appreciated that network is embodiment of the present invention version advanta- 

used in its broadest sense, i.e., to denote a collection of 60 geously can be predetermined. This alternative embodiment 

machines that communicate with one another. Thus, while a advantageously can employ non-overlapping fixed-size 

local area network (LAN) is one form of network, the blocks within subbands of transforms, that are tiled up to 

Internet, the plain old telephone system (POTS), to name but cover the entire image. Typical block size ranges from 

a few, are all networks and, thus, are considered to fall 32x32 to 128x128, with 64x64 being the most common. It 

within the scope of the hardware implementation of the 65 will be noted that the blocks are coded, independently of 

present invention. In one preferred embodiment, the com- each other, into packets. Once a block is coded, the coded 

puter 910 includes storage 912, memory 914, a CPU 916 and data is stored into a packet that is placed in a buffer memory, 
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e.g., RAM. These packets, one for each coded block, pref- 
erably are stored in the buffer until all the blocks in the 
image transform have been processed or the desired rate 
target has been met. 

It should be mentioned that the way in which the blocks 
are coded helps in reducing the amount of memory used by 
the coder. Transformed coefficients are passed on a line-by- 
line basis to the coder. As soon as a block is filled up with 
coefficients, the block is encoded using the SPECK algo- 
rithm. As previously mentioned, once the block is coded, the 
contents of the block are released from memory. In this way, 
blocks are coded as soon as they are filled up and then 
discarded from memory. This greaUy reduces the amount of 
memory needed for the whole coding system. 

It should also be mentioned that the blocks arc coded in 
the same way as the SPECK algorithm codes sets of type S. 
Recursive quadtree partitioning advantageously can be used 
to test and partition the block being processed. Multiple lists 
are used to help keep track of the different sets and coeffi- 


15 


extra overhead redundancy and, hence, the coding perfor- 
mance would improve. Simply concatenating the coded 
packets together will result in a block-embedded bitstream. 

In short, one of the most important implications of the 
SPECK algorithm is its potential for constructing low- 
memory usage coders. Although there are a few other coders 
meeting the same requirements, yet the key characteristic of 
the coder according to the present invention, which makes 
the SPECK algorithm stand apart from all of the rest, is the 
SPECK algorithm's low-complexity. This ensures very fast 
coding speeds, apart from exhibiting very good coding 
performance. 

As discussed in detail above, the SPECK algorithm has 
proved to be a very effective image coding scheme. The 
SPECK algorithm is a block-based hierarchical coder which 
makes use of image statistics within a subband. It does not 
exploit inter-subband correlation and codes sub bands inde- 
pendently of each other. SPECK has been seen to perform 
well for all types of natural images, but particularly well for 
high frequency images. The SPECK algorithm itself oper- 


cients in the block. It is interesting to note that the lists are 20 ateson each block independently. Coding of the significance 

now associated with the blocks instead of the entire image. map could employ the context significance states of pixels in 

Hence, even the size of the list is greatly reduced. Also, the other subbands, such as parent pixels in a spatial orientation 

lists are reset after the coding of each block. Thus, it can be tree structure. As mentioned above, entropy coding of sig- 

easily seen that all this contributes to a significant reduction nificance map is optional. The SPIHT algorithm, on the 

in memory usage, and the amount of memory needed for the 25 other hand, is also a very successful and efficient image 

entire coding system is on the order of the size of blocks coding scheme. It is, however, a tree-based hierarchical 

being processed. coder which employs image statistics across subbands. 

It will be appreciated that since the preferred embodiment Thus, the SPIHT algorithm exploits inter-subband correla- 

of the coder under discussion utilizes line-based wavelet tion and codes various subbands at the same time, making 

transforms, it is important to know the order in which blocks 30 use of their interdependency. SPIHT has also been seen to 


of fixed size are visited. Assuming that the image is tiled into 
64 blocks, with 8 blocks along each direction, a few lines of 
the original image will be read in until there are enough lines 
to do the transformation with the particular wavelet filter. It 
will be noted that this would result in a 1 -level decompo- 
sition of the image. 

It should be mentioned that since the subbands having the 
high frequency content will not be further decomposed, the 
coefficients corresponding to these subbands will be sent to 
the encoder. Hence, the Low-High, High-Low and High- 
High subbands, which are discussed in detail with respect to 
U. S. Pat. No. 5,764,807, will receive line coefficients. These 
subbands will keep receiving these lines of coefficients until 
enough lines have been received to fill up at least one block. 


35 


40 


perform well for all types of natural images, but gives 
slightly better performance than SPECK for low frequency 
images. It will be appreciated from the discussion above, 
particularly considering that the LIS employed in the 
SPECK algorithm corresponds to both the LIS and LIP of 
the SPIHT algorithm, that a hybrid coder where the low 
frequency data of the image are coded using SPIHT algo- 
rithm while the high frequency data of the image are coded 
using SPECK algorithm advantageously can be provided. 

Moreover, although the SPECK coding algorithm has 
been tested primarily on still images, there are many alter- 
native paths to extend the SPECK algorithm to video. 
According to one preferred embodiment of the present 
invention, the SPECK algorithm can be employed with 


Again, once a block is encoded, the coefficients belonging to 45 respect to video by employing a 3-D extension of the 

the block are discarded from memory, freeing up memory SPECK algorithm over a fixed or variable sequence of 

space. This process continues in a similar fashion until all images. It will be appreciated that a three dimensional (3-D) 

the blocks are filled up and coded. It will be noted that this wavelet transform advantageously can be applied over a 

is the order of coding blocks into packets. This order is sequence of images, and the whole image sequence can be 

changed to a predefined order, e.g., the regular raster scan- 50 partitioned in the form of cubes, instead of blocks, as is done 


ning order for blocks, before forming the final bitstream for 
transmission. It is important to note that the reverse order of 
visiting the blocks takes place at the decoder, since the 
decoder will perform line-based inverse wavelet transfor- 
mation. 

As mentioned earlier, blocks are coded independently into 
packets which are stored in buffer memory. The final bit- 
stream is generated by combining the packets in any arbi- 
trary manner. There is great exibility in this postprocessing 
stage of bitstream generation. Depending on the type of 
coding desired, the final bitstream can be made either 
perfectly embedded or block-embedded, i.e., embedded 
within blocks. It will be appreciated that the requirement of 
perfect embedded ness can be relaxed a little by making the 
bitstream block-embedded, which means that the data within 
a packet/block will be embedded but the entire bitstream 
will not be strictly embedded. This would make the use of 


for a single image frame. A simple extension of the 2-D 
SPECK algorithm to 3-D would code the image sequence in 
a manner analogous to the SPECK algorithm. 
In another exemplary embodiment applying the SPECK 
55 algorithm to video coding, the SPECK algorithm advanta- 
geously can be applied to the motion-compensated image 
sequences. It will be appreciated that the current MPEG 
standards, i.e., MPEG-1, MPEG-2, and MPEG-4, support 
the concept of macroblocks, whereby blocks of image are 
entropy coded after adequate motion compensation. These 
video coding standards are thought to be generic in the sense 
that they do not specify any particular entropy-coding 
scheme by which to encode these macroblocks. Hence, the 
block-based SPECK algorithm discussed above meshes very 
well into the various MPEG frameworks. 

Finally, it should be mentioned that while the SPECK 
coding algorithm has been tested primarily on gray-scale 
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monochrome images, it will be appreciated that color 
images advantageously can be processed in a similar man- 
ner. Color images can be thought of as being composed of 
three monochrome images, corresponding to either the lumi- 
nance and two chrominance components or red-green-blue 5 
(RGB) values, e.g., 24-bit true color. Hence, SPECK algo- 
rithm can be applied to each of these components or planes 
separately to code a color image. Moreover, additional 
coding efficiency advantageously can be achieved by taking 
advantage of the strong correlation among these different id 
color components. 

Although presently preferred embodiments of the present 
invention have been described in detail hereinabove, it 
should be clearly understood that many variations and/or 
modifications of the basic inventive concepts herein taught, 15 
which may appear to those skilled in the pertinent art, will 
still fall within the spirit and scope of the present invention, 
as defined in the appended claims. 

What is claimed is: 

1. A method for use in encoding and decoding a data set 20 
representing an image, comprising: 

a first subroutine for partitioning the data set into first and 
second sets, for adding the first set into a list of 
insignificant sets (LIS), and for initializing a list of 
significant pixels (LSP); 25 

a second subroutine for testing the first and second sets for 
significance with respect to a threshold value, parti- 
tioning significant members of the first and second sets 
in accordance with first and second partitioning 
functions, respectively, and adding significant pixels to 30 
the LSP; a third subroutine for refining the quantization 
of the pixels in the LSP; and 

a fourth subroutine for decrementing the threshold value, 

wherein the second, third and fourth subroutines are 35 
repeated until encoding/decoding of the data set has 
been completed. 

2. The method as recited in claim 1, wherein the fourth 
subroutine further comprises a routine for entropy coding a 
significance map generated in accordance with the second 40 
and third subroutines. 

3. The method as recited in claim 1, wherein members of 
the first set demonstrating significance with respect to the 
threshold value are partitioned employing a quadtree parti- 
tioning scheme. 45 

4. The method as recited in claim 3, wherein members of 
the first set demonstrating significance with respect to the 
threshold value are recursively partitioned employing a 
quadtree partitioning scheme. 
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5. The method as recited in claim 1, wherein members of 
the second set demonstrating significance are with respect to 
the threshold value partitioned employing octave band par- 
titioning. 

6. The method as recited in claim 5, wherein members of 
the second set demonstrating significance are with respect to 
the threshold value recursively partitioned employing octave 
band partitioning. 

7. The method as recited in claim 2, wherein the subrou- 
tine for entropy coding is performed using arithmetic cod- 
ing. 

8. The method as recited in claim 1, wherein the image 
comprises a sequence of images which vary over time. 

9. The method as recited in claim 1, wherein the second, 
third and fourth subroutines are repeated until one of the 
encoding/decoding of the data indicative of the lowest bit 
plane has been completed and the bit budget has been spent. 

10. A coder for use in encoding and decoding a data set 
corresponding to an image, comprising: 

first means for partitioning the data transformation into 
first and second sets, for adding the first set into a list 
of insignificant sets (US), and for initializing a list of 
significant pixels (LSP); 

second means for testing the first and second sets for 
significance with respect to a threshold value, parti- 
tioning significant members of the first and second sets 
in accordance with first and second partitioning 
functions, respectively, and adding significant pixels to 
the LSP; and 

third means for refining the quantization of the pixels in 
the LSP; 

wherein said second and third means are employed seria- 
tim as the threshold value is decremented until 
encoding/decoding of the data set has been completed. 

11. The coder as recited in claim 10, further comprising: 
fourth means for entropy coding a significance map 

cooperatively generated by the second and third means. 

12. The coder as recited in claim 10, wherein members of 
the first set demonstrating significance with respect to the 
threshold value are partitioned employing a quadtree parti- 
tioning scheme. 

13. The coder as recited in claim 10, wherein members of 
the second set demonstrating significance are partitioned 
employing octave band partitioning. 

14. The coder as recited in claim 11, wherein the fourth 
means performs entropy coding using arithmetic coding. 

***** 
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